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GREGORIO MALAJOVICH 

Abstract. An apriori bound for the condition number associated to each of the 
following problems is given: general linear equation solving, minimum squares, non- 
symmetric eigenvalue problems, solving univariate polynomials, solving systems of 
multivariate polynomials. It is assumed that the input has integer coefficients and 
is not on the degenerate locus of the respective problem (i.e. the condition number 
is finite). Then condition numbers are bounded in terms of the dimension and of 
the bit-size of the input. 

In the same setting, bounds are given for the speed of convergence of the following 
iterative algorithms: QR without shift for the symmetric eigenvalue problem, and 
Graeffe iteration for univariate polynomials. 



1. Introduction 

In most of the numerical analysis literature, complexity and stability of numerical 
algorithms are usually estimated in terms of the problem instance dimension and of 
a 'condition number'. 

For instance, the complexity of solving an n x n linear system Ax = b is usually 
estimated in terms of the dimension n (actually the input size is n{n + 1)) and of the 
condition number ^(^4) = || A \\^ \\ A^^ . 

There is a set of problems instances with k{A) = oo, and in most cases it makes no 
sense to attempt solving those problem instances. There are also problem instances 
(in our case, matrices) close to the locus of degenerate problem instances. Those will 
have a large condition number, and will be said to be ill-conditioned. 

It is usually accepted that ill-conditioned problem instances are hard to solve. 
Thus, for complexity purposes a problem instance with a large condition number 
should be considered 'large'. Therefore, when considering problems defined for real in- 
puts, a reasonable measure for the input size would be (in our example): log2 k{A). 



(Compare to [T^ Formula 2.1 and paragraph below. See also the discussion in [|l| 



Chapter 3, Section 1). 

Another tradition, derived from classical complexity theory and pervasive in several 
branches of literature (such as linear programming), is to consider the subset of 
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problems instances with integer coefficients. Hence the input size is the number of 
coefficients times the bit-size of the largest coefficient (in absolute value). 

In this paper, the following classical problems of numerical analysis are considered: 

1. Solving a general n x n system of linear equations. 

2. Minimal squares problem for a full-rank matrix. 

3. Non-symmetric eigenvalue problem. 

4. Solution of one univariate polynomial. 

5. Solution of a non-degenerate system of n polynomial equations in n variables. 

All those problems share the feature mentioned above: there is a degenerate locus, 
and problem instances with real coefficients can be as close to the degenerate locus 
as wished. This implies that they can be arbitrarily ill-conditioned. 

However, in Theorems to ^ below, we provide bounds for the condition number 
of problems instances with integer coefficients and not in the degenerate locus. Those 
bounds depend on the dimension (size) of the problem instance and on the bit-size 
of its coefficients. 

In the analysis of iterative algorithms, one further considers a certain quantity that 
can be used to bound the speed of convergence and hence the number of iterations 
to obtain a given approximation. For instance, for power methods (or QR iteration 
without shift) in the symmetric eigenvalue problem, one can bound the number of 
steps in terms of the desired accuracy and of the ratio between different eigenvalues. 
The farther this number is from 1, the faster is the convergence. 

Once again, if input has real coefficients, this quantity can be arbitrarily close to 
1. However, explicit bounds for that quantity will be given for inputs with integer 
coefficients for 

6. QR iteration without shift for the Symmetric Eigenvalue Problem. 

7. Graeffe iteration for solving univariate polynomials. 

The reader should be warned that the results herein are worst case estimates, and 
are overly pessimistic for application purposes. The main motivation for those results 
is to convert numerical analysis estimates into 'polynomial time' estimates, not the 
opposite. 



2. Statement of main results 



Notation. 



2 stands for the 2-norm: if a; G M" or C", then 




If v4 is a matrix, then 



= sup 
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2.1. Linear equation solving. The first problem considered is linear equation solv- 
ing: given an n X n matrix A and a vector 6 G M", find x G MJ^ such that Ax = b. 
Its condition number (with respect to the 2-norm) is defined as 

k{A)= \\A\\, \\A''\\^ . 

Comprehensive treatment of the perturbation theory for this problem can be found 
in the literature, such as ||^ Section 2.2, Chapter 7, Lecture 12, etc... 

Theorem 1. Let A be an n x n matrix with integer coefficients. If A is invertible, 
then 

k{A) < I max \Aij 



No originality is claimed for Theorem |I[ This result is included for completeness 
and because its proof is elementary, yet illustrates the principle behind the other 
results. 

2.2. Minimal squares. The second problem in the list is minimal squares fitting. 
Let A be an m X n matrix, m > n, with full rank, and let b G M™. One has to find 
X to minimize || Ax — bW^'^. 

Let r = Ax — b he the residual, we are minimizing || r ^. Let 

" r " 



sin 6 
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According to |§| p. 117 (Compare to [Q Lecture 18 and 0] Section 19.1), the 
condition number of the linear least squares problem is 

2k(A) 

kls{A, b) = + tan 9 K{Af . 

COSU 

Since we do not assume A to be square, we need to give a new definition for k{A). 
Let a"MAx(^) and crMiN(^) be respectively the largest and the smallest singular values 
of A. Then set 

, / O-MAx(^) 

o'MmyA) 

When m = n, this definition is equal to the previous one. 

The singular locus is now the set of pairs {A, b) such that A does not have full rank 
(i.e. o"min(^) = 0) or such that || r = || ^ II2 (i-e. b is orthogonal to the image of 
A). 

The result is: 

Theorem 2. Let A be an m x n matrix with integer coefficients, and assume that A 
has full rank. Let b G Z™. Set H = maxjj (|Ail, \bi\)- Then if b is not orthogonal to 
the image of A, we have: 

^Ls{A,b) < Snt+^m'^+^i/^n+i _ 
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2.3. Non-symmetric eigenvalue problem. Let A be an n x n matrix and let A 
be a single eigenvalue of A. The condition number of A depends on the angle between 
the left and right eigenvectors: 

Let X, y be respectively right and left norm-1 eigenvectors of A associated to A: 
Ax = Xx, y*A = Ay*, and || x = || y II2 = 1- Then 

knse(^, A) = sec{x/y) = . 

y*x 

See Theorem 4.4 p. 149 for references. 

Theorem 3. Let A be an n x n matrix with integer coefficients, and let X be a single 
eigenvalue of A. Then 

KNSE{A,X)<n'-2'-{2V^HiA)Y'''-''' . 

2.4. Solving univariate polynomials. The condition number (in affine space) for 
solving a univariate polynomial /(x) = J2i=o fi^^ '^^^ be defined (0 page 228) as: 

where 

, , (Et„ ICP 

The degenerate locus is the set of polynomials with a multiple root or with a root 
at infinity. 

Theorem 4. Let f : x ^ SiLo /«^* ^ univariate polynomial with integer coeffi- 
cients, without multiple roots. Then 

M/)<22'^^-V^(max|/,|)^'^^ . 

2.5. Solving systems of polynomials. A similar condition number exists for sys- 
tems of polynomials. However, for the purpose of condition number theory, it is 
usually convenient to homogenize the equations and to study the perturbation the- 
ory of the 'roots' in complex projective space. This can also be seen as a change of 
metric, that simplifies the formula of the condition number and of several theorems 
(See [| Chapters 10, 12, 13). 

Let / = (/i, ■■■ , /„) be a system of polynomials in variables xi,--- We 
homogenize the system by multiplying each coefficient fijX"^ = fijx'l^x'^^ ■ ■ ■ x'^ of 
fi by Xq°, where we choose Jq = deg/j — (Ji + ■■■Jn)- We obtain a system of 
homogeneous polynomials in + 1 variables, that we call F = [Fi, ■ ■ ■ ,Fn). The 
natural space for the roots of F is projective space P", defined as the space of all 
'rays' 

(xo : ■ ■ ■ : Xn) = {{Xxo, ■ ■ ■ , Xxn) : A G C} . 
where not all equal to 0. 
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Every finite root (xi, ■ ■ ■ , x„) of / corresponds to tlie projective root of F given by 
[1 : Xi : ■ ■ ■ : and projective roots of F correspond either to a finite root of / or 
to a root 'at infinity'. 

Suppose tliat tlie coefficients of / (lience of F) are made to depend upon a pa- 
rameter t. The condition number bounds tlie absolute speed of the roots of F (in 
projective space) with respect to the absolute speed of the coefficients of F. Recall 
that the roots ^ of F are in projective space, so their speed vector C belongs to the 
tangent space T^P". 

The condition number of F at a root turns out to be: 

L-l 



F 



c 



c 



where ( e C"+^ is such that {(q : ■ ■ ■ : Cn) is a root of F (See Proposition 7c in Page 
230 of 0). We did not define the norm of a polynomial yet. Above, || . II2 stands 
for the unitary invariant norm (See |]T5[ Chapter III-7 or Section 12.1), that is the 
most reasonable generalization of the 2-norm to spaces of polynomials: 

Notation. Let G be a homogeneous degree d polynomial in n + 1 variables. Then 



G 



\ 



\Gj\^ 

d 
J 



where 



d 
J 



IS 



Jo\-JJ 



Let F be a system of homogeneous polynomials. Then 



F 



E 



With these definitions, the number fi{F, () is invariant under scalings of F, (, and 
under the action of the unitary group U{n + 1), where an element Q G SU{n + 1) 
acts hjQ:{F,()^{FoQ,QC). 

In order to define the condition number of a system of n equations in n variables, 
we set: 



max^(F,C) 



where C ranges over the roots of F. (Another possibility is to restrict ( to the non- 
degenerate roots of F. This would make no difference in this paper). The following 
theorem is true if one restricts ( to any subset of the roots of F. 

Theorem 5. Let f be a system ofn polynomial equations in n variables, with integer 
coefficients. We write H{f) for the maximum of the absolute value of the coefficients 
of f , S{f) for the number of non-zero coefficients of f and D for max di. Assume 
that fi{f) is finite. Then 

M<{{n + l)SHif)r" 
where c is an universal constant. 
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2.6. Symmetric eigenvalue problem. Let A be an n x n real positive symmetric 
matrix, and let Ai > A2 > ■ • ■ A„ > be its eigenvalues. 

Unlike the non-symmetric eigenvalue problem, the symmetric eigenvalue prob- 
lem has absolute condition number always equal to 1 (See Theorem 5.1. See 
also citePARLETT Fact 1.11 p. 16). 

However, when using an iterative algorithm, the ratio of eigenvalues 

p{A) = min ^ 

may play an important role for estimating convergence. For instance, according 
to Theorem 28.4, the QR algorithm without shift converges linearly with speed 
^jp^. Convergence may get slower when p{A) 1. Therefore one can bound the 
speed of convergence by bounding 

u{A) = p{A) - 1 = min ~ ^' 

j>i Xi 

above from zero. If ly^A) > 60, then p{A) > 1 + ^o- After k > iterations, one 

gets 

p{A)'' > 1 + Mo > 2 . 
Thus it suffices to perform 0{j- logg j-) iterations to obtain a result with accuracy 

Si. 

Also, the quantity //(A)"^ can also be interpreted as a condition number for the 
eigenvectors (See Theorem 5.7 p. 208). We will show here that 

Theorem 6. Let A be an n x n matrix with integer coefficients. Then 

-2-n? 



v{A)-^ > 8-"(4n)-"' (^max|yli^|^ 



2.7. Graeffe iteration. Let f : x ^ Yli=o fi^\ /d = 1 be a monic univariate 
polynomial with zeros Ci; ' ' ' iCd- Those zeros can be ordered such that 

ICil>IC2|>--->ICd|. 

The Graeffe operator maps the polynomial f{x) = Y[i=ii^~Ci) iiito the polynomial 

Gf{x) = (-i)^/(Vi)/(-vi) = nti(^ - C). 

In ||T0| , pj| , it is explained how to recover the actual roots of / after a certain 
number of Graeffe iterations, with a good approximation. The number of required 
iterations depends on the ratio: 

P(/) = , max . 

Unlike in Section |2.6| , we do not require here that the roots have different absolute 
value. We consider also the auxiliary quantity 

K/) = - 1 = max M_M . 

\(A>\(^\ \Q\ 
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By the above definitions, the 'condition number' zy(/)~^ is always finite. In order 
to recover the roots within relative precision 6, the number of Graeffe iterations to 
perform is 

0(log //(/)-!+ log rf + log log r^) . 

For clarity of exposition, we will show that bound under a special hypothesis: all 
the roots should be different positive real numbers. For the general case, see 
and 0. Also, all estimates here are 'up to the first order', and quadratic error terms 
will be discarded. 

After k steps of Graeffe iteration one obtains the polynomial 

d d 

^(x) = GV(x) = 5^<7.x^ = n(^-Cf) 

with p{g) = p{ff\ 



i=0 i=l 

\2'' 

Expanding each gi as the {d — z)-th elementary symmetric function of the Q'^ ^ one 
obtains 



90 = ^d(Ci Xd) 

ga-i = cri(Ci ,Cd ) 
9d = l 

We can use the special hypothesis to bound 



with \Si\ < + h.o.t. Hence 



9d~i 



With 15^1 <1^ + h.o.t. 

Since we assumed the Q are all positive, we can recover them by taking 2'^-th roots 

V 9d-i J 

with \5''\ < + h.o.t. 

Now we can use the estimate on p{g) = p{,fY to deduce that 0(log /v(/)^^+log 5^^) 
steps are sufficient to obtain a relative precision 5 in the roots. Indeed after ki = 
log2Z^(/)"^ steps. 
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After extra k2 = log2((i + 1 + log2 S ^) steps, one gets 

So we can set A; = A;i + A;2 + 1, the last 1 to get rid of the high order terms, to 
deduce that \6'/\ < 6. 

Theorem 7. Let f : x ^ SiLo /«^* ^ polynomial with integer coefficients. Then 

z/(/)-'>(8max|/,|)-^^ . 

This says that Graeffe iteration is 'polynomial time', in the sense that we can 
obtain relative accuracy 5 of the roots after 

0((ilogmax + log log 

steps. 

3. Background material 

The proof of Theorems ^ to |^ will make use of the absolute multiplicative height 
function H to bound inequalities involving algebraic numbers. 

The construction of the height function H is quite standard in number theory and 
we refer the reader to @] Chapter II or to |12] pages 205-214. For applications to 
complexity theory, see [0] Chapter 7 and 0. 

The height function is naturally defined in the projectivization P"'(Q'*) of the alge- 
braic numbers Q**. It returns a real number > 1. We can also extend it to complex 
projective space P" by setting H{P) = oo when P ^ P"(Q^). We will adopt this 
convention in order to simplify the notation of domains and ranges. 

Also, if a; = (xi, ■ ■ ■ , x„) G C", we can define its height as H{x) = H{xi : ■ ■ ■ : Xn '■ 
1). 

We can also define the height of matrices, polynomials and systems of polynomials 
as the height of the vector of all the coefficients. 

The following properties of heights will be used in the sequel. First of all, we can 
explicitly write the height of a vector with integer coefficients as: 

Proposition 1. If u E Z", then H{u) = maxi<j<„ where \- \ is the standard 
absolute value. 

Proposition [I| follows from the construction of the height function. One immediate 
consequence is that if f e Q", then H{v) = max |mwj|, \m\ where m is the greatest 
common denominator of the fj's. 

We can use the following fact to bound the height of the roots of an integral 
polynomial: 

Proposition 2. If f : x ^ f{x) = Yli=o /«^* ^■^ non-zero polynomial with integer 
coefficients, and if x is a root of f , then H{x) < 2max|/j|. 
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Proposition ^ is Theorem 5 in [|^. Compare with Theorem 5.9 in |T^, where the 
coefficients of / are algebraic numbers. 

We can use a bound on the height to bound absolute values above and below: 

Proposition 3. Let K he an algebraic extension of Q, and let x & K , x ^ 0. Then 

The height of a vector and of its coordinates can be related by: 
Proposition 4. 

H{xi) < H{xi, ■■■ ,Xn)< H{xi)H{x2) ■ ■ ■ H{Xn) . 

Propositions |^ and ^ follow immediately from the construction of the height func- 
tion. The height function is invariant under permutation of coordinates, and also: 

Proposition 5. Let K he an algebraic extension of Q, and let g G GaliK-.Q]- Then 
for any x E K , H{g{x)) = H{x) 



Proposition ^ is Lemma 5.10 in [|12 
Proposition 6. Let 

F = (Fo,- ■ ■ : C"i X X ■ ■■ X C"^- ^ C"^+i 

pi^... ^pfc ^ F{P\--- ,P^) 

be a system of multi-homogeneous polynomials with algebraic coefficients, where each 
Fi has degree dj in variables PK Let the P^ be algebraic. Then 

H{F{P)) < {maxS{f,))H{F)H{Py' ■ ■ ■ H{P^Y*^ . 

In the case /c = 1, this is similar to Theorem 5.6 in ||T^ (where maxiS'(/j) is not 
given explicitly). For the general case see Theorem 4 in [[^. 

Proposition 7. Let 

G = {Gi,--- ,Gm) : C"i X X ■ ■ • X C"^ ^ 

Q\---,Q' ^ G{Q\---,Q'') 

be a system of polynomials with algebraic coefficients, where each Gi has degree at 
most dj in variables . Let the be algebraic. Then 

H{G{Q)) < {maxS{G,))H{G)H{Qy' ■ ■■H{Q^Y^ . 

This is Corollary 1 in |^. Some consequences of this are that H^Y^^^Xi) < 
n]\H{xi) and that i^dlLi^i) < ]\H{xi). 

The following fact follows also from the construction of heights: 

Proposition 8. If x is an algebraic number, 

H{x^) = H{xf . 

Also, it makes sense to bound the height of the roots of a system of polynomials 
with respect to the height, size and degree of the system. Corollary 6 in ||^ is: 
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Proposition 9. [Krick and Pardo] Let fi, ■ ■ ■ , fr f n be polynomials in Z[xi, ■ ■ • 
, Xr] of degree and height bounded by d> n and r], respectively, and let V denote the 
algebraic affine variety defined by: V = {x : fi{x) = ■ ■ ■ fr{x) = 0} . 
Then V has at most d"" isolated points, and their height verifies: 

log2//(P)Gd'^(")(log2r + log2r/) . 

4. Proof of Theorems 

Notation. If A is a real (resp. complex) matrix, then A* is the real (resp. complex) 
transpose of A, {A*)ij = Aji. The same convention will be used for vectors. 

The vectors of the canonical basis will be denoted by Ci = [1, 0, 0, ■ ■ ■ ]*, 62 = 
[0,1,0,---]*, etc... 

4.1. Proof of Theorem [l|. 

\\A\\2 = sup II Aw by definition 

< \uj\ II [Aij, ■ ■ ■ , Anj]* II 2 by triangular inequality 

j 

< y/nmax \\ [Aij, • • • , A^j]* II2 since || u II2 =1 

< n max | A^j \ 

ij 

Let A{i, u) be the matrix obtained by replacing the z-th column of A by the vector 
u. Then ii v = A~^u, Cramer's rule is: 

det A{i,u) 

Vi = 



det A 

Since A has integer coefficients and det A 7^ 0, one can always bound \vi\ < 
det A{i,u)\. By Hadamard inequality, this implies: 



n-l 



Therefore, 



\vi\ < II u II2 max II [Aij, ■■■ , Anj]* 
< II II2 (yn)"~^ ^max |y4y|^ 



v4 ^ II2 = sup II ^4 II2 by definition 

Il«ll2=l 

n-l 



< n2 \ max | A,,| 1 

Combining the bounds for || A II2 and || A~^ , one obtains: 

k{A) < ^max|y4y|^ . 
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4.2. Proof of Theorem In order to estimate k,{A), we write 

k{A) = ^k{A*A) 

In order to bound cos^, we use the assumption that b is not orthogonal to the 
image of A. Hence || A*b II2 > 1 and the 'normal equation' A* Ax = A*b implies: 



Therefore, 



A* Ax II2 > 1 . 



IIAMxIL 1 1 
cos 9 = " „ . „ > —— > 



MI2 11^112 

and and tan 9 are bounded above by H^/m. Putting all together, 



4.3. Proof of Theorem |§. 

Lemma 1. Let B he an n ^ n matrix with integer coefficients. Let p{t) = det(i? — 
tl) = J2Pii'- Then 

max|pj| < I 2y^ma,x\Bij\ I 

Proof of Lemma |i|. 

Pi = ^±detC 

c 

where C ranges over the (n — i) x (n — i) sub-matrices of B of the form Cki = BskSi 
for some 1 < Si < ■ ■ ■ < < n. Hence 

\Pi\ ^ ^ ) ^C"^ ^ 

\ / \ n—i 

n 



< ( . I ( \/n — % max I C, 



I I \ tj 

n 

< ( 2v^max \Bij\ 



□ 



Lemma 2. Let A be an n x n matrix with integer coefficients and let A be an eigen- 
value of A. Then 

H{\) < 2 ( 2v/nmax|Aj 
Proof of Lemma ^. Apply Proposition ^ to the polynomial p{t) from Lemma [l|. □ 
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Lemma 3. Let B he an n x n matrix with integer coefficients. Let q[t) = det{B — 
tl + telcn) = J2'lit'- Then 

max I Qi max |R- 

Proof of Lemma ^. Let p{t) = det(-B — tl) and let r{t) = det(i? — tl) where B is the 
(n — 1) X (n — 1) matrix obtained by deleting the n-th row and the n-th column of 
B. Then, by multi-linearity of the determinant, 

p{t) = q{t)±tr{t) , 

hence 

q{t)=pit)±tr{t) . 

Therefore, 

max \qi\ < max \pi \ + max |rj | 

< I 2y/nmax\Bij\ ] + I 2\/n — 1 max \ Bij\ ] (Lemma ffl) 



< 2{2y/nmax\B, 



1-3 



□ 



Lemma 4. Let A he an n x n matrix with integer coefficients. Let A he an isolated 
eigenvalue of A and let x he an eigenvector associated to X, Ax = Ax. Then 

H{xi : ■ ■ ■ : Xn) < n2'^ {2^/nmax\Aij\ j 

Proof of Lemma 0. Assume without loss of generality that the first n — 1 lines of 
A — XI are independent. Let Mi, . . . , Mj, . . . , M„ be the sub-matrices obtained 
from A — XI hj deleting the last line and the z-th column. Then we can scale x in 
such a way that 

Xi = ± det Mi . 

We have M„ = Bn — XI. By reordering rows and columns, we obtain for each i < n 
that Mi is of the form: 

Mi = B,-XI + Ae;_ie„_i 
where Bi is the sub-matrix of A obtained by deleting the last line and the i-th column. 



Set q^'\X) = det Mi = ^qfx^. Now by Lemma 



max|gj'^| < 2 ^y/n — 1 max |v4fc/|^ 



ra-l 
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We consider now the morphism: 

q: P P'^ 

(A:l) ^ (g«(A) : ■■■ : g(")(A)) 

Then x = q{X) and 

Hix) = HiqiX)) 

< nH{q)H{X)"-^ 

/ \ '1^1 / \ n(n— 1) 

< n2 ( 2y/n - 1 max | Ail ) 2""^ ( 2^/nmax | A 



< n2" ( 2^/nmaix\Ai 



the first inequahty because of Proposition and the second because of Lemma |^. □ 



End of the Proof of Theorem Proposition |^ imphes 

H{y*x) < nH{x)H{y) . 



We claim that deg[Q[y*2;] : Q] < n. Indeed, x and y can be obtained by solving 
systems of linear equations with coefficients in Q[A], thus Xi,yi G Q[A]. Also, Yi ^ 
= Q[A] so deg[Q[y*a;] : Q] < n as claimed. 
By hypothesis y*x ^ 0. Hence, by Proposition ^, 



\y*x\ > {nH{x)H{y)y 

> n-^"2-2" ( 2v^max|A 



-2n^+2n 



4.4. Proof of Theorem 

Proof. According to Proposition 0, 

/f(C) <2max|/,| . 

Also, 

H{f[)<dH{f) = dmax\f,\ 

and according to Proposition ^ 

Hif'iO) < dH{r)H{CY-' 
< d^2'^-^H{fY 



□ 
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and hence |/'(C)| > d~^'^H{f)-'^^2-^'^'^-^\ On the other hand, 



EICI'M < v^max(l,|C|') 



,i=0 



Hence 



< VdTl2'^'H{ff 



□ 



4.5. Proof of Theorem 

Lemma 5. Let A he an n x n invertible matrix with algebraic coefficients. Then 

H{A-^) < nH{AY . 

Proof of Lemma ^. Let A{i,j) be the sub-matrix of A obtained by deleting the i-th 

)ji 



row and the j-th column. By Cramer's rule, {A ^)^.. = Therefore we should 



define the degree n morphism: 

(All : Ai2 : ■ ■ ■ ^ 

■ ■ ■ • Ann '■ 1) 

Then by Proposition ^ 

H{<p{A)) < n\H{AYH{ip) 
< n\H{AY 



(detA(l,l) : dety4(l,2) : ■■■ 
■ ■ ■ : det A{n, n) : det A) 



□ 



Let us fix the notations 



M 



c 



1-di 



and 



C = DF(C)p4 



C\\ 
'5 112 



di-l 



■df(c)' 



c 



Let C G C"^ be a fixed representative for a root of F. Any u G Tj^P" can be 
written as a vector in C""*"^, orthogonal to Computing u = Cv is the same as 

V 





solving Mu 



The operator C is the same as (M ^)|^^^^=o- Therefore, 



CII2 < II M-^ 
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Lemma 6. In the conditions of Theorem 

H{M) < SV^TTl''^^ HiCf'-^DHif) . 
Proof of Lemma 6. We apply Proposition |^ to the system: 



F)F- 

C,A^-(---,iv*-^^(C),---,C„ 



with = II |L """to obtain 



2 

-D-l TJf AT\D-l 



H{M) < SH{C) H{N)''-'H{DF) . 

We can bound H{DF) < DH{F) and H{N) = if( || ( ) = ^//(E lOP)- We can 
apply Proposition ^ to the map 

if : C"+^ ^ C 

to get H{N^) <{n + 1)H{C)H{C). Proposition | implies H{C) = H{C), hence: 
and by Proposition & 



H{N) < V^TTiHic) ■ 

Thus, we can estimate that 

H{M) < SV^T+l'"'^ H{Cf''-^DH{f) . 



□ 



End of the Proof of Theorem By definition of the norm, II/II2 < SH{f). By 
Lemma ^ and Lemma 6, we have: 

H{C) < (n + l)i7(M)" 



Knowing that deg [Q[C] : Q] < -D", we can use Proposition |^ to deduce that 
IICII2 < {n + l)H{C) 

< (n + 1) (^{n + l)5"y^^Tr'''"i^(C)'"''~'"D"i/(/)"' 
According to Proposition ^, 

where c' is a universal constant. Thus, 
< ((n + l)5i/(/))^^" 
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where c is a universal constant. 



4.6. Proof of Theorem Lemma |^ imphes: 

H{X) < 2 (^2y/nmax \Aij^ . 

Hence (Proposition |^), 
Thus, by Proposition ^, 

2n2 



^-1 



< 8^^(471)"' (^maxlAjl^ 



□ 



4.7. Proof of Theorem 0. According to Proposition 

HiQ) < 2H{f) 

H{Q) < 2H{f) 

Moreover, if(|Ci|) < ^(G) because lOP = and H^Q) = H{Ci) (Propositions |, 
and BI). Thus, 



It follows that 

i^if)-' > (8i7(/))-''^^*'"''i'"'^iJ^^' > {8H{f)y 



5. Further comments 

As mentioned before, a reasonable definition for the 'real complexity' input size 
is the number of coefficients of a given problem instance, times the logarithm of its 
condition number. 

Theorems |l| to ^ show that the 'real complexity' input size is no worse than a 
polynomial of the 'classical complexity' input size, for problem instances with integer 
coefficients. Theorem ^ also, if one considers as part of the input size. It may be 
possible to replace by the Bezout number Y[ di, that is the number of solutions 
of a generic system of polynomials. 

Since the 'real complexity' of the problems considered can be bound by common 
numerical analysis techniques, those Theorems provide a scheme to convert 'real 
complexity' bounds into 'classical complexity' bounds. 

The same idea is behind Theorems ^ and ^ In the case of the iterative algorithms 
considered, the number of iterations for obtaining a certain approximation can also 
be bounded in terms of a 'condition number'. In the case of problem instances with 
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integer coefficients, the 'condition number' is also polynomially bounded in terms of 
the input size. 

Those Theorems have many features in common, and this is not a coincidence. 
A more general approach is to interpret the condition number as the inverse of the 
distance to the degenerate locus. This can be bounded in terms of the height of the 
problem instance, and in terms of the degenerate locus (degree, dimension, height). 
However, bound obtained this way will be no sharper and possibly worse than the 
direct bounds obtained by using the exact expression for the condition number. 

This paper was written while the author was visiting MSRI at Berkeley. He wishes 
to thank MSRI for its generous support. Thanks to Bernard Deconinck, Jennifer 
Roveno, Paul Gross, Raquel, and very special thanks to Paulo Ney de Souza and 
family. 
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